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We present measurements of the top quark mass (mt) in candidate events with two final state 
leptons using 1 fb~^ of data collected by the DO experiment. Our data sample is selected by requiring 
two fully identified leptons or by relaxing one lepton requirement to an isolated track if at least one 
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jet is tagged as a & jet. The top quark mass is extracted after reconstructing the event kinematics 
under the tt hypothesis using two methods. In the first method, we integrate over expected neutrino 
rapidity distributions, and in the second we calculate a weight for the possible top quark masses 
based on the observed particle momenta and the known parton distribution functions. We analyze 
83 candidate events in data and obtain mt = 176.2 ± 4.8(stat) ± 2.1(sys) GeV and rrit = 173.2 ± 
4.9(stat) ± 2.0(sys) GeV for the two methods, respectively. Accounting for correlations between the 
two methods, we combine the measurements to obtain mt = 174.7 ± 4.4(stat) ± 2.0(sys) GeV. 

PACS numbers: 12.15.Ff, 14.65.Ha 



I. INTRODUCTION 

After the top quark was discovered in 1995 [H, em- 
phasis quickly turned to detailed studies of its properties 
including measuring its mass across all reconstructable 
final states. Within the standard model, a precise mea- 
surement of the top quark mass (mt) and W boson 
mass {M\y) can be used to constrain the Higgs boson 
mass {Mfi). In fact, these masses can be related by 
radiative corrections to Mw- One- loop corrections give 

= si„^g(^'^^^r) , where Ar depends quadratically on 
mt and logarithmically on Mh 3 • Beyond its relation to 
Mh, the top quark mass refiects the Yukawa coupling, Yt, 
for the top quark via Yt = where v = 246 GeV 

is the vacuum expectation value of the Higgs field ^j]. 
Given that these couplings are not predicted by the the- 
ory, Yt = 0.995 ± 0.007 for the current mt Q is curiously 
close to unity. One of several possible modifications to 
the mechanism underlying electroweak symmetry break- 
ing suggests a more central role for the top quark. For 
instance, in top-color assisted technicolor [0, 0| , the top 
quark plays a major role in electroweak symmetry break- 
ing. These models entirely remove the need for an ele- 
mentary scalar Higgs field in favor of new strong interac- 
tions that provide the observed mass spectrum. Perhaps 
there are extra Higgs doublets as in MSSM models Q; 
measurement of the top quark mass may be sensitive to 
such models (e.g., Ref. ^). 

In the standard model, BR{t — + Wb) is expected to 
be nearly 100%. So the relative rates of final states 
in events with top quark pairs, ti, are dictated by the 
branching ratios of the W boson to various fermion pairs. 
In approximately 10% of ti events, both W bosons de- 
cay leptonically. Generally, only events that include the 
W ^ ev and W ^ jjLv modes yield final states with 
precisely reconstructed lepton momenta to be used for 
mass analysis. Thus, analyzable dilepton final states are 
tt W -\- uv' + bb, where i, £' = e, ji. We measure mt 
in these dilepton events. The W ^ tv ^ e{fi)i>P decay 
modes cannot be separated from the direct W — *■ e{pL)v 
decays and are included in our analysis. 

Dilepton channels provide a sample that is statisti- 
cally independent of the more copious ti ^ -\- qq' + bb 
{i+iets) decays. The relative contributions of specific 
systematic effects are somewhat different between mass 
measurements from events with dilepton or £-|-jets fi- 
nal states. The jet multiplicity and the dominant back- 
ground processes are different. The measurement of mt 



in the dilepton channel also provides a consistency test 
of the ti event sample with the expected t Wb decay. 
Non-standard decays of the top quark, such as i ^ H'^b, 
can affect the final state particle kinematics differently 
in different ti channels. These kinematics affect the re- 
constructed mass significantly, for example in the £-|-jets 
channel [l3|- Therefore, it is important to precisely test 
the consistency of the mt measurements in different chan- 
nels. 



Previous efforts to measure mt in the dilepton chan- 
nels have been pursued by the DO and CDF collabora- 
tions. A frequently used technique reconstructs individ- 
ual event kinematics using known constraints to obtain 
a relative probability of consistency with a range of top 
quark masses. The "matrix weighting" method (MWT) 
follows the ideas proposed by Dalitz and Goldstein jTl| 
and Kondo [l2|- It uses parton distribution functions 
and observed particle momenta to obtain a mass esti- 
mate for each dilepton event, and has previously been 
implemented by DO [H, [13. The "neutrino weighting" 
method (z^WT) was developed at DO [11]. It integrates 
over expected neutrino rapidity distributions, and has 
been used by both the DO [fl, [3 and CDF collab- 
orations. 



In this paper, we describe a measurement of the top 
quark mass in 1 fb~^ of pp collider data collected us- 
ing the DO detector at the Fermilab Tevatron Collider. 
Events are selected in two categories. Those with one 
fully identified electron and one fully identified muon, two 
electrons, or two muons are referred to as "2£." To im- 
prove acceptance, we include a second category consist- 
ing of events with only one fully reconstructed electron 
or muon and an isolated high transverse momentum (pt) 
track as well as at least one identified 5 jet, which we refer 
to as "£-|-track" events. We describe the detection, selec- 
tion, and modeling of these events in Sections [TTl and [1111 
Reconstruction of the kinematics of ti events proceeds by 
both the MWT and vWT approaches. These methods 
are described in Section IIVI In Section |Vl we describe 
the maximum likelihood fits to extract mt from data. Fi- 
nally, we discuss our results and systematic uncertainties 
in Section IVll 
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II. DETECTOR AND DATA SAMPLE 
A. Detector Components 

The DO Run II detector jTgl is a multipurpose collider 
detector consisting of an inner magnetic central tracking 
system, calorimeters, and outer muon tracking detectors. 
The spatial coordinates of the DO detector are defined as 
follows: the positive z axis is along the direction of the 
proton beam while positive y is defined as upward from 
the detector's center, which serves as the origin. The 
polar angle B is measured with respect to the positive z 
direction and is usually expressed as the pseudorapidity, 
T] = — ln[tan(6'/2)]. The azimuthal angle is measured 
with respect to the positive x direction, which points 
away from the center of the Tevatron ring. 

The inner tracking detectors are responsible for mea- 
suring the trajectories and momenta of charged particles 
and for locating track vertices. They reside inside a su- 
perconducting solenoid that generates a magnetic field 
of 2 T. A silicon niicrostrip tracker (SMT) is innermost 
and provides precision position measurements, particu- 
larly in the azimuthal plane, which allow the reconstruc- 
tion of displaced secondary vertices from the decay of 
long-lived particles. This permits identification of jets 
from heavy flavor quarks, particularly h quarks. A central 
fiber tracker is composed of scintillating fibers mounted 
on eight concentric support cylinders. Each cylinder sup- 
ports one axial and one stereo layer of fibers, alternat- 
ing by ±3° relative to the cylinder axis. The outermost 
cylinder provides coverage for \r]\ < 1.7. 

The calorimeter measures electron and jet energies, di- 
rections, and shower shapes relevant for particle identi- 
fication. Neutrinos are also measured via the calorime- 
ters' hermeticity and the constraint of momentum con- 
servation in the plane transverse to the beam direction. 
Three liquid-argon-filled cryostats containing primarily 
uranium absorbers constitute the central and endcap 
calorimeter systems. The former covers \r\\ < 1.1, and 
the latter extends coverage to \ri\ = 4.2. Each calorime- 
ter consists of an electromagnetic (EM) section followed 
longitudinally by hadronic sections. Readout cells are ar- 
ranged in a pseudo-projective geometry with respect to 
the nominal interaction region. 

Drift tubes and scintillators are arranged in planes out- 
side the calorimeter system to measure the trajectories 
of penetrating muons. One drift tube layer resides in- 
side iron toroids with a magnetic field of 1.8 T, while 
two more layers are located outside. The coverage of the 
muon system is jryj < 2. 

B. Data Sample 

The DO trigger and data acquisition systems are de- 
signed to accommodate instantaneous luminosities up to 
3 X lO'^^ cm~^s~"'^. The Tevatron operates with 396 ns 
spacing between proton (antiproton) bunches and deliv- 



ers a 2 MHz bunch crossing rate. For our data sample, 
each crossing yields on average 1.2 pp interactions. 

Luminosity measurement at DO is based on the rate 
of inelastic pp collisions observed by plastic scintillation 
counters mounted on the inner faces of the calorimeter 
endcap cryostats. Based on information from the track- 
ing, calorimeter, and muon systems, the first level of the 
trigger limits the rate for accepted events to 2 kHz. This 
is a dedicated hardware trigger. Second and third level 
triggers employ algorithms running in processors to re- 
duce the output rate to about 100 Hz, which is written 
to tape. 

Several different triggers are used for the five decay 
channels considered in this measurement. We employ sin- 
gle electron triggers for the ee and e-t-track channels and 
single muon triggers for the ^/i and n+track channels. 
The e/i analysis employs all unprescaled triggers requir- 
ing one electron and/or one muon. We also use triggers 
requiring one lepton plus one jet for the £-|-track channels. 
A slight difference between the vWT and MWT analyses 
occurs because the latter excludes 2% of data collected 
while the single muon trigger was prescaled. While the 
effect on the kinematic distributions is negligible, this re- 
sults in one less /i/i candidate event in the final sample 
for the MWT analysis. 

Events were collected with these triggers at DO be- 
tween April 2002 and February 2006 with = 1.96 TeV. 
Data quality requirements remove events for which the 
tracker, calorimeter, or muon system are known to be 
functioning improperly. The integrated luminosity of the 
analyzed data sample is about 1 fb^^. 



C. Particle Identification 

We reconstruct the recorded data to identify and mea- 
sure final state particles, as described below. A more 
detailed description can be found in Ref. ,17]. 

The primary event vertex (PV) is identified as the ver- 
tex with the lowest probability to come from a soft pp 
interaction based on the transverse momenta of associ- 
ated tracks. We select events in which the PV is re- 
constructed from at least three tracks and with \zpv\ < 
60 cm. Secondary vertices from the decay of long-lived 
particles from the hard interaction are reconstructed 
from two or more tracks satisfying the requirements of 
Pt > 1 GeV and more than one hit in the SMT. We re- 
quire each track to have a large impact parameter signif- 
icance, DCA/ctdca > 3.5, with respect to the PV, where 
DCA is the distance of the track's closest approach to 
the PV in the transverse plane. 

High-pT muons are identified by matching tracks in the 
inner tracker with those in the muon system. The track 
requirements include a cut on DCA< 0.02 (0.2) cm for 
tracks with (without) SMT hits. Muons are isolated in 
the tracker when the sum of track momenta in a cone 
of radius A7?,(muon, track) = y/ (A??)^ + (^0)^ = 0.5 
around the muon's matching track is small compared to 
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the track pT- We also require isolated muons to have the 
sum of calorimeter cell energies in an annulus with radius 
in the range 0.1 < ATZ < 0.4 around the matched track 
to be low compared to the matching track pT- 

High-pT isolated tracks are identified solely in the in- 
ner tracker. We require them to satisfy track isola- 
tion requirements and to be separated from calorimeter 
jets by A7?.(jet, track) > 0.5. These tracks must corre- 
spond to leptons from the PV, so we also require that 
DCA/ctdca < 2.5. We avoid double-counting leptons by 
requiring A7?.(track, > 0.5. 

Electrons are identified in the EM calorimeter. Cells 
are clustered according to a cone algorithm within ATZ < 
0.2 and then matched with an inner detector track. Elec- 
tron candidates are required to deposit 90% of their en- 
ergy in the EM section of the calorimeter. They must 
also satisfy an initial selection which includes a shower 
shape test (Xhmx) "^ith respect to the expected elec- 
tron shower shape, and a calorimeter isolation require- 
ment summing calorimeter energy within ATZ < 0.4 but 
excluding the cluster energy. To further remove back- 
grounds, a likelihood (£e) selection is determined based 
on seven tracking and calorimeter parameters, including 
Xhmx' DCA, and track isolation calculated in an annulus 
of 0.05 < ATZ < 0.4 around the electron. The final elec- 
tron energy calibration is determined by comparing the 
invariant mass of high pT electron pairs in Z/"f* — > e~^e~ 
events with the world average value of the Z boson mass 
as measured by the LEP experiments [3| . 

In ti events, the leptons and tracks originate from the 
hard interaction. Therefore, we require their z positions 
at the closest approach to the beam axis to match that 
of the PV within 1 cm. 

We reconstruct jets using a fixed cone algorithm [l^ 
with radius of 0.5. The four-momentum of a jet is 
measured as the sum of the four-momenta assigned to 
calorimeter cells inside of this cone. We select jets that 
have a longitudinal shower profile consistent with that of 
a collection of charged and neutral hadrons. We confirm 
jets via the electronically independent calorimeter trig- 
ger readout chain. Jets from b quarks are tagged using 
a neural network b jet tagging algorithm [l^. This com- 
bines the impact parameters for all tracks in a jet, as well 
as information about reconstructed secondary vertices in 
the jet. We obtain a typical efficiency of 54% for b jets 
with |?7| < 2.4 and pr > 30 GeV for a selection which 
accepts only 1% of light flavor (u, d, s quark or gluon) 
jets. 

Because the b jets carry away much of the rest energy 
of the top quarks, it is critical for the measurement of 
nit that the measurements of the energies of jets from 
top quark decay be well calibrated. Jet energies deter- 
mined from the initial cell energies do not correspond to 
the energies of final state particles striking the calorime- 
ter. As a result, a detailed calibration is apphed 0, Hlj 
in data and Monte Carlo separately. In general, the en- 
ergy of all final state particles inside the jet cone, Ej*''^^, 
can be related to the energy measured inside the jet cone. 



Ej, by i;f' ' = {Ej -0)/{RS). Here, O denotes an off- 
set energy primarily from extra interactions in or out of 
time with an event. R is the cumulative response of the 
calorimeter to all of the particles in a jet. S* is the net 
energy loss due to showering out of the jet cone. For 
a given cone radius, O and S are functions of the jet 77 
within the detector. O is also a function of the number of 
reconstructed event vertices and the instantaneous lumi- 
nosity. R is the largest correction and reflects the lower 
response of the calorimeters to charged hadrons relative 
to electrons and photons. It also includes the effect of 
energy losses in front of the calorimeter. The primary 
response correction is derived in situ from 7-l-jet events 
and has substantial dependences on jet energy and rj. 
For all jets that contain a non-isolated muon, we add the 
muon momenta to that of the jet. Under the assumption 
that these are b quark semileptonic decays, we also add 
an estimated average neutrino momentum assumed to be 
collinear with the jet direction. The correction procedure 
discussed above does not correct all the way back to the 
original b quark parton energy. 

The event missing transverse energy, ^t, is equal 
in magnitude and opposite in direction to the vector 
sum of all signiflcant transverse energies measured by 
the individual calorimeter cells. It is corrected for the 
transverse momenta of all isolated muons, as well as for 
the corrections to the electron and jet energies. In the 
^+track channels, the is also corrected if the track 
does not point to a jet, electron, or muon. In this case, 
we substitute the track pT for the calorimeter energy 
within a cone of radius ATZ — 0.4 around the track. 

D. Signal and Background Simulation 

An accurate description of the composition and kine- 
matic properties of the selected data sample is essential 
to the mass measurement. Monte Carlo samples for the tt 
processes are generated for several test values of the top 
quark mass. The event generation, fragmentation, and 
decay are performed by pythia 6.319 [22|. Background 
processes are called "physics" backgrounds when charged 
leptons originate from W or Z boson decay and when 
l^T comes from high pT neutrinos. Physics backgrounds 
include Z/^* tt with t ^ e^fi and diboson [WW, 
WZ, and ZZ) production. The Z/7* tt background 
processes are generated with ALPGEN 2.11 [23| as the 
event generator and pythia for fragmentation and decay. 
We decay hadrons with b quarks using evtgen To 
avoid double counting QCD radiation between ALPGEN 
and PYTHIA, the jet-parton matching scheme of Ref. [2^ 
is employed in ALPGEN. The diboson backgrounds are 
simulated with pythia. We use the CTEQ6L1 [26l| par- 
ton distribution function (PDF). Monte Carlo events are 
then processed through a GEANT-based 27[ simulation 
of the DO detector. In order to accurately model the ef- 
fects of multiple proton interactions and detector noise, 
data events from random pp crossings are overlaid on the 
Monte Carlo events. Finally, Monte Carlo events are pro- 
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cessed through the same reconstruction software as used 
for data. 

In order to ensure that reconstructed objects in these 
samples reflect the performance of the detector in data, 
several corrections are applied. Monte Carlo events are 
reweighted by the z coordinate of the PV to match the 
profile in data. The Monte Carlo events are further tuned 
such that the efficiencies to find leptons, isolated tracks, 
and jets in Monte Carlo events match those determined 
from data. These corrections depend on the and 77 
of these objects. The jet energy calibration derived for 
data is applied to jets in data, and the jet energy cali- 
bration derived for simulated events is applied to simu- 
lated events. We observe a residual discrepancy between 
jet energies in Z-|-jets events in data and Monte Carlo. 
We apply an additional correction to jet energies in the 
Monte Carlo to bring them into agreement with the data. 
This adjustment is then propagated into the ^t- We ap- 
ply additional smearing to the reconstructed jet and lep- 
ton transverse momenta so that the object resolutions in 
Monte Carlo match those in data. Owing to differences 
in 6-tagging efficiency between data and simulation, h- 
tagging in Monte Carlo events is modeled by assigning 
to each simulated event a weight defined as 

P = l- [][l-p,(r;, pT, flavor)], (1) 

i=l 

where (77, pt, flavor) is the probability of the i**^ jet 
to be identified as originating from a h quark, obtained 
from data measurements. This product is taken over 
all jets. Instrumental backgrounds are modeled from a 
combination of data and simulation and are discussed in 
Section Unci 

III. SELECTED EVENT SAMPLE 

Events are selected for all channels by requiring 
either two leptons (2£) or a lepton and an isolated track 
(£-|-track), each with -pT > 15 GeV. Electrons must be 
within I77I < 1.1 or 1.5 < \r]\ < 2.5; muons and tracks 
should have jvyj < 2.0. An opposite charge requirement 
is applied to the two leptons or to the lepton and track. 
At least two jets are also required with pseudorapidity 
|?7| < 2.5 and px > 20 GeV. We require the leading 
jet to have px > 30 GeV. Since neutrinos coming from 
W boson decays in tt events are a source of significant 
missing energy, a cut on is a powerful discriminant 
against background processes without neutrinos such as 
Z/7* — > ee and Z/j* —^ 1j.f1. All channels except e/i 
require at least > 25 GeV. 

A. 2£ Selection 

Our selection of 2£ events follows Ref. [2^. In the ee 
channel, events with a dielectron invariant mass M^e < 
15 GeV or 84 < Mee < 100 GeV are rejected. We require 
> 35 GeV and > 45 GeV when M^e > 100 GeV 



and 15 < M^e < 84 GeV, respectively. In the /i/i channel, 
we select events with M^^ > 30 GeV and > 40 GeV. 
To further reject the Z/j* jifj, background in the fj,fi 
channel, we require that the observed be inconsistent 
with arising solely from the resolutions of the measured 
muon momenta and jet energies. 

In the e/i analysis, no cut on is applied because the 
main background process Z/j* — > tt generates four neu- 
trinos having moderate pt- Instead, the final selection in 
this channel requires = pfji + J^i-^x) > US GeV, 
where pjl denotes the transverse momentum of the lead- 
ing lepton, and the sum is performed over the two leading 
jets. This requirement rejects the largest backgrounds for 
this final state, Z/j* — tt and diboson production. We 
require the leading jet to have px > 40 GeV. 

The selection described above is derived from that used 
for the tt cross-section analysis. Varying the and jet 
Pt selections indicated that this selection minimizes the 
statistical uncertainty on the mt measurement. We select 
17 events in the ee channel and 13 events (12 events for 
MWT) in the /i/i channel. We select 39 events in the e/i 
channel. 



B. £~f track Selection 

The selection for the ^-|-track channels is similar to 
that of Ref. [l3]. For the e-|-track channel, electrons are 
restricted to \r]\ < 1.1, and the leading jet must have 
Pt > 40 GeV. The dominant ^-|-track background arises 
from Z ee and Z fifi production, so we design the 
event selection to reject these events. 

When the invariant mass of the lepton-track pair (Ai^t) 
is in the range 70 <Mit< 110 GeV, the requirement is 
tightened to > 35 (40) GeV for the e-t-track (/t-|-track) 
channel. Furthermore, we introduce the variable 
that corrects the I^t m Z ^ £i events for mismeasured 
lepton momenta. We rescale the lepton and track mo- 
menta according to their resolutions to bring Mgt to the 
mass of the Z boson (91.2 GeV) and then use these 
rescaled momenta to correct the ^t- Event selection 
based on this variable reduces the Z background by half 
while providing 96% efficiency for tt events. The cuts on 
are always identical to those on ^t- 

At least one jet is required to be identified as a 6 
jet which provides strong background rejection for the 
^-|-track channels. The mt precision is limited by signal 
statistics in the observed event sample when the back- 
ground is reasonably low. The above selection is a result 
of an optimization which minimizes the statistical uncer- 
tainty on rrit- We do this in terms of ^t, -^t'^', the 
transverse momenta of the leading two jets, and the b- 
tagging criteria by stepping through two or more different 
thresholds on these requirements. After considering all 
possible sets of selections, we choose the one which gives 
the best average expected statistical uncertainty on the 
mt measurement using many pseudoexperiments. The 
expected statistical uncertainty varies smoothly over a 



8 



15% range while the study is sensitive to 5% changes of 
the average statistical uncertainty. 

We explicitly veto events satisfying the selection of any 
of the H channels, so the ^+track channels are statisti- 
cally independent of the 21 channels. We select eight 
events in the e+track channel and six events in the 
/i+track channel. 



C. Modeling Instrumental Backgrounds 

Backgrounds can arise from instrumental effects in 
which the J^t is mismeasured. The main instrumental 
backgrounds for the ee, /i/i, e+track, and /i+track chan- 
nels are the Z j^* — > ee and Z j^* — > /i/i processes. In 
these cases, apparent Jfi^ results from tails in jet or lep- 
ton resolutions. We use the NNLO cross section for 
Z/7* ee,/i/i processes, along with the Monte Carlo- 
derived efficiencies to estimate these backgrounds for the 
ee and ///i channels. The Monte Carlo kinematic dis- 
tributions, including the -^t, are verified to reproduce 
a data sample dominated by these processes. For the 
^+track channels, we normalize Drell-Yan Monte Carlo 
so that the total expected event yield in a ^+track sam- 
ple with low JfiT equals the observed event yield in the 
data. We observe a slightly different distribution for 
simulated Z ^ II events in comparison with data. As 
a result, all Z boson simulated samples, including the 
Z ^ TT physics background samples, are reweighted to 
the observed distribution of in data [2§| . 

Another background arises when a lepton or a track 
within a jet is identified as an isolated lepton or track. We 
utilize different methods purely in data to estimate the 
level of these backgrounds for each channel. In all cases, 
however, we distinguish reconstructed muons and tracks 
as "loose" rather than "tight" by releasing the isolation 
criteria. We make an analogous distinction for electrons 
by omitting the requirement on the electron likelihood, 
£e, for "loose" electrons. 

To determine the misidentified electron background 
yield in the ee and e/x channels, we fit the observed distri- 
bution of in the data to a sum of the distributions from 
real isolated electrons and misidentified electrons. We de- 
termine the shape of /!e for real electrons from a Z — > ee 
sample wither < 15 GeV. For the ee channel, we extract 
the shape for the misidentified electrons from a sample in 
which one "tag electron" is required to have both Xhmx 
and >Ce inconsistent with being from an electron. We fur- 
ther require M^e < 60 GeV or Mee > 130 GeV and 
< 15 GeV to reject Z and W boson events. The distribu- 
tion of £e is obtained from a separate "probe electron" in 
the same events. In the e/i channel, the i2e distribution 
for misidentified electrons is obtained in a sample with a 
non-isolated muon and Ifir < 15 GeV. 

To estimate the background from non-isolated muons 
for the e/i and /x/i channels, we use control samples to 
measure the fraction of muons, with > 15 GeV 
that appear to be isolated. To enhance the heavy flavor 



content which gives non- isolated muons, the control sam- 
ples are selected to have two muons where a "tag" muon 
is required to be non-isolated. We use another "probe" 
muon to determine The background yield for the e/i 
channel is computed from the number of events having 
an isolated electron, a muon with no isolation require- 
ment, and the same sign charge for the two leptons. We 
multiply the observed yield by 

We estimate the instrumental background for the /i/i 
and £+track channels by using systems of linear equa- 
tions describing the composition of data samples with dif- 
ferent "loose" or "tight" lepton and/or track selections. 
We relate event counts in these samples to the numbers 
of events with real or misidentified isolated leptons using 
the system of equations. These equations take as inputs 
the efficiencies for real or misidentified leptons and tracks 
to pass the tight identification requirements. For the /i/i 
and £+track channels, we determine the efficiencies for 
real leptons and tracks to pass the tight identification 
criteria using Z ee and Z — > /i/i events. 

For the ^+track channels, the probabilities for misiden- 
tified leptons and tracks to pass the tight selection cri- 
teria are determined from multijet data samples with at 
least one loose lepton plus a jet. We reject the event if two 
leptons of the same flavor satisfy tight criteria to suppress 
Drell-Yan events. We also reject events with one or more 
tight leptons with different flavor from the loose lepton. 
These tight lepton vetos allow some events with two loose 
leptons or a lepton and track in the sample. We further 
suppress resonant Z production by selecting events when 
Mft and M« > 100 GeV or and M« < 70 GeV. We 
reject M^+jets events and misreconstructed Zj^* events 
by requiring < 15 GeV and ^^^^< 25 GeV. Here, 
^j^^ is the missing transverse energy with only jet en- 
ergy corrections and no lepton corrections. We use the 
latter because loose leptons no longer adhere to standard 
resolutions. We calculate the probability for electrons 
or muons to be misidentifled by dividing the number of 
tight leptons by the number of loose leptons. For the 
track probability, we combine the e+jet and fi+jet sam- 
ples and make the additional requirement that there be 
at least one loose track in the event. The tight track 
misidentiflcation probability is again the number of tight 
tracks divided by the number of loose tracks. 

To obtain samples dominated by misidentified isolated 
leptons for mass analysis, we select events with two loose 
leptons or tracks plus two jets. For the 2i channels, we 
additionally require same sign dilepton events. 

D. Composition of Selected Samples 

The expected numbers of background and signal events 
in all five channels (assuming a top quark production 
cross section of 7.0 pb) are listed in Table[T]along with the 
observed numbers of candidates. The /i+track selection 
has half the efficiency of the e+track selection primarily 
due to the tight /i/i veto. The expected and observed 
event yields agree for all channels. 
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TABLE L Expected event yields for ti (we assume au — 7.0 pb) and backgrounds and numbers of observed events for the five 
cliannels. Tlie 2£ diannel uncertainties include statistical as well as systematical uncertainties while the e+track and /i+track 
uncertainties are statistical only. 

Sample tt Diboson Z Multijet/W+jets Total Observed 

'ejl 36.7 ± 2.4 1.7 ± 0.7 4.5 ± 0.7 2.6 ± 0.6 44.5 ± 2.7 39 

ee 11.5 ± 1.4 0.5 ± 0.2 2.3 ± 0.4 0.6 ± 0.2 14.8 ± 1.5 17 

H/i 8.3 ± 0.5 0.7 ± 0.1 4.5 ± 0.4 0.2 ± 0.2 13.7 ± 0.7 13 

e+track 9.4 ± 0.1 0.1 ± 0.0 0.4 ± 0.1 0.4 ± 0.1 10.3 ± 0.2 8 

/i+track 4.6 ± 0.1 0.1 ± 0.0 0.7 ± 0.1 0.1 ± 0.0 5.5 ± 0.1 6 
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FIG. 1: Comparison of the expected distributions from backgrounds and tt {rrit — 170 GeV) in the ^+track channels, (a) Met 
for the e+track channel without the requirement of the 6-tag and with inverted cuts, (b) for the sum of both i?+track 
channels, again without the 6-tag requirement. We assume au = 7.0 pb. 



Kinematic comparisons between data and the sum of 
the signal and background expectations provide checks 
of the content and properties of our data sample. Fig- 
ure [ija) shows the expected and observed distributions 
of Met in the e+track channel without the 6-tag require- 
ment and for an inverted requirement. The /i+track 
distribution looks similar (not shown). The mass peak 
at Mz indicates the e+track sample is primarily com- 
posed of Z ee events before the final event selection. 
In Fig. [Hb), we show the distribution in the ^+track 
channels after all cuts except the &-tag requirement. 



The expected numbers of background and signal events 
after all selections in all five channels are listed in Ta- 
ble |T] along with the observed numbers of candidates. We 
assume a{tt) — 7.0 pb. We do not include systematic 
uncertainties for the £+track channels. The small back- 
grounds mean their uncertainties have a negligible effect 
on the measured mt uncertainty. The /t+track selection 
has half the efficiency of the e+track selection primarily 
due to the tight fifi veto. The expected and observed 
event yields agree for all channels. Figures [U^a) and (b) 
show the and leading lepton px summed over all chan- 
nels for the final candidate sample. We observe the data 
distributions to agree with our signal and background 
model. 



IV. EVENT RECONSTRUCTION 

Measurement of the dilepton event kinematics and con- 
straints from the tt decay assumption allow a partial re- 
construction of the final state and a determination of mt . 
Given the decay of each top quark to a boson and a 
b quark, with each W boson decaying to a charged lep- 
ton and a neutrino, there are six final state particles: 
two charged leptons, two neutrinos, and two b quarks. 
Each particle can be described by three momentum com- 
ponents. Of these eighteen independent parameters, we 
can directly measure only the momenta of the leptons. 
The leading two jets most often come from the b quarks. 
Despite final state radiation and fragmentation, the jet 
momenta are highly correlated with those of the underly- 
ing b quarks. We also measure the x and y components of 
the ^T, and ^y, from the neutrinos. This leaves four 
quantities unknown. We can supply two constraints by 
relating the four-momenta of the leptons and neutrinos 
to the masses of the W bosons: 

M^ = {E,,+Ei,f -{p,,+pi,f, 

where the subscript indices indicate the pair coming 
from one or another W boson. Another constraint is 
supplied by requiring that the mass of the top quark and 
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FIG. 2: (a) and (b) leading lepton pr for tt (rat ~ 170 GeV) and background processes overlaid with those for observed 
events in all channels after final event selection. We assume att ~ 7.0 pb. 



the mass of the anti-top quark be equal: 

(E^, + El, + Eb, f - {p^, +pi, +pb,f = 
(E^^ + Ei^ + Eb^f - {p^^ +pi^ +Pb2f- 



(3) 



The last missing constraint can be supphed by a hypoth- 
esized value of the top quark mass. With that, we can 
solve the equations and calculate the unmeasured top 
quark and neutrino momenta that are consistent with 
the observed event. Usually, the dilepton events are kine- 
matically consistent with a large range of mf. We quan- 
tify this consistency, or "weight," for each mt by test- 
ing measured quantities of the event (e.g., or lepton 
and jet pr) against expectations from the dynamics of ti 
production and decay. This requires us to sample from 
relevant tt distributions, yielding many solutions for a 
specific mt- We sum the weights for each solution for 
each TTif. The distribution of weight vs. mt is termed a 
"weight distribution" of a given event. Using parameters 
from these weight distributions, we can then determine 
the most likely value of mt . 

Several previous efforts to measure mt using dilepton 
events have used event reconstruction techniques. The 
differences between methods stem largely from which 
event parameters are used to calculate the event weight. 
We use the z/WT and MWT techniques to determine 
the weighting as described below. 

A. Neutrino Weighting 

The i^WT method omits the measured for kine- 
matic reconstruction. Instead, we choose the pseudora- 
pidities of the two neutrinos from ti decay from their ex- 
pected distributions. We obtain the distribution of neu- 
trino 7] from several simulated ti samples with a range of 
mt values. These distributions can each be approximated 
by a single Gaussian function. The standard deviation 
specifying this function varies weakly with mt- Once the 
neutrino pseudorapidities are fixed and a value for mt as- 
sumed, we can solve for the complete decay kinematics, 
including the unknown neutrino momenta. There may 



be up to four different combinations of solved neutrino 
momenta for each assumed pair of neutrino rj values for 
each event. We assume the leading two jets are the h jets, 
so there are two possible associations of W bosons with 
h jets. 

For each pairing of neutrino momentum solutions, we 
define a weight, w, based on the agreement between the 
measured and the sum of the neutrino momentum 
components in x and y, p'^ and Py. We assume indepen- 
dent Gaussian resolutions in measuring arrd . The 
weight is calculated as 



exp 



calc 



exp 



_^^calc_ ^obs^2 



(4) 

reflecting the agreement between the measured and cal- 
culated ^T- ^f^^ {i = X or y) are the components of 
the measured event , and ^f^^'^ are the components of 
the calculated from the neutrino transverse momenta 
resulting from each solution. We calculate the quantities 
to be the sums of the energies projected onto the i 
axes measured by all "unclustered" calorimeter cells - 
those cells not included in jets or electrons. The high px 
objects, leptons and jets, enter into the determination of 
both ^1^'"^ and whereas the unclustered energy 
only enters into Given the resolutions cr" of the 

we can therefore estimate the probability that the 
are consistent with the ^^'^''^ from the ti hypothesis. 

As parameters of the method, we determine cr" using 
Z ^ ee + 2 jets data and Monte Carlo events. We calcu- 
late an unclustered scalar transverse energy, S^, as the 
scalar sum of the transverse energies of all unclustered 
calorimeter cells. Due to the azimuthal isotropy of the 
calorimeter, we observe that the independent x and y 
components of the af depend on S"^ in the same way 
within their uncertainties. Therefore, we combine results 
for both components to determine our resolution more 
precisely. We find agreement between data and simula- 
tion in the observed dependence of these parameters on 
S"^. The distributions are shown for these combined res- 
olutions in Fig. [31 We fit the unclustered resolutions 
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FIG. 3: Dependence of the resolution of unclustered on 
the unclustered scalar transverse missing energy for Z — > ee 
events with exactly two jets. 



obtained from simulation as 

a^{S^) = a^{S^) 4.38 GeV + 0.52^5^ GeV, (5) 

and use this parametrization for the unclustered miss- 
ing energy resolution for both data and Monte Carlo in 
Eq. Q. 

For each event, we consider ten different rj assump- 
tions for each of the two neutrinos. We extract these 
values from the histograms appropriate to the mt being 
assumed. The ten rj values are the medians of each of ten 
ranges of rj which each represent 10% of the ti sample for 
a given mt- 



B. Matrix Weighting 

In the MWT approach, we use the measured momenta 
of the two charged leptons. We assign the measured mo- 
menta of the two jets with the highest transverse mo- 
menta to the b and b quarks and the measured to the 
sum of the transverse momenta of the two neutrinos from 
the decay of the t and t quarks. We then assume a top 
quark mass and determine the momenta of the t and t 
quarks that are consistent with these measurements. We 
refer to each such pair of momenta as a solution for the 
event. For each event, there can be up to four solutions. 

We assign a weight to each solution, analogous to the 
i^WT weight of Eq. 21 given by 



w = f{x)f{x)p{El\mt)p{El\mt), 



(6) 



where f{x) is the PDF for the proton for the momentum 
fraction x carried by the initial quark, and f{x) is the cor- 
responding value for the initial antiquark. The quantity 
El is the observed lepton energy in the top quark rest 
frame. We use the central fit of the CTEQ6L1 PDFs 
and evaluate them at = ml. The quantity p{E1\mt) 
in Eq. [5] is the probability that for the hypothesized top 



quark mass mt, the lepton I has the measured E'l [llj: 

4mtE*{m'^ -ml- 2mtE*) 

I^{m^,-ml)-2M^- 

(7) 



p{Eg\mt) 



{ml - mlY + M^{mi - ml) 



C. Total Weight vs. mt 

Equations [4] and [6] indicate how the event weight is 
calculated for a given top quark mass in the i^WT and 
MWT methods. In each method, we consider all solu- 
tions and jet assignments to get a total weight, Wtot, for 
a given mt. In general, there are two ways to assign the 
two jets to the b and b quarks. There are up to four 
solutions for each hypothesized value of the top quark 
mass. The likelihood for each assumed top quark mass 
mt is then given by the sum of the weights over all the 
possible solutions: 



Wtot 



(8) 



where j sums over the solutions for each jet assignment i. 
We repeat this calculation for both the j^WT and MWT 
methods for a range of assumed top quark masses from 
80 GeV through 330 GeV. 

For each method, we also account for the finite resolu- 
tion of jet and lepton momentum measurements. We re- 
peat the weight calculation with input values for the mea- 
sured momenta (or inverse momenta for muons) drawn 
from normal distributions centered on the measured val- 
ues with widths equal to the known detector resolutions. 
We then average the weight distributions obtained from 
N such variations: 



Wtot{mt) = N' 



N 

E 

n=l 



Xmt). 



(9) 



where N is the number of samples. One important bene- 
fit of this procedure is that the efficiency of signal events 
to provide solutions increases. For instance, the vWT 
efficiency to find a solution for tt^ efi events is 95.9% 
without resolution sampling, while 99.5% provide solu- 
tions when N = 150. For the MWT analysis, events with 
mt — 175 GeV yield an efficiency of 90% without resolu- 
tion sampling. This rises to over 99% when N — 500. We 
use N = 150 and 500 for vWT and MWT, respectively. 

Examples of a single event weight distribution for vWT 
and MWT are shown in Fig. |4] for two different simu- 
lated events. The sensitivities of the two methods are 
similar on average, but the different widths of the weight 
distributions vary significantly in both approaches on an 
event-by-event basis. This can be caused by an over- 
all insensitivity of an event's kinematic quantities to mt, 
or to a different sensitivity when using those kinematic 
quantities with specific event reconstruction techniques. 

Properties of the weight distribution are strongly cor- 
related with mt if the top quark decay is as expected in 
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FIG. 4: Example weight distributions for different single ti 
efi Monte Carlo events obtained with a) i^WT method and b) 
MWT method. The generator level mass is rrit = 170 GeV. 



the standard model. For instance, Fig.[5]^a) illustrates the 
correlation of the mean of the i^WT weight distribution, 
fiu], with the generated top quark mass from the Monte 
Carlo. The relationship between the root-mean-square of 
the weight distribution, a^, and fj.^ also varies with mt, 
as shown in Fig. E^b). There is the potential for non- 
standard decays of the top quark. For mt = 170 GeV 
and assuming BR{H'^ tv) ^ 100%, we observe ^jl^ 
(I'WT) to shift systematically upward when a boson 
of mass 80 GeV is present in the decay chain instead of 
a W boson. When BR(t ^ HH) =100%, this shift is 
10%. Thus, the measurements of this paper are strictly 
valid only for standard model top quark decays. 



V. EXTRACTING THE TOP QUARK MASS 

We cannot determine the top quark mass directly from 
or from the most probable mass from the event weight 
distributions, max^,. Effects such as initial and final 
state radiation systematically shift these quantities from 
the actual top quark mass. In addition, the presence of 
background must be taken into account when evaluating 
events in the candidate sample. We therefore perform 
a maximum likelihood fit to distributions ("templates") 
of characteristic variables from the weight distributions. 
This fit accounts for the shapes of signal templates and 



for the presence of background. 

A. Measurement Using Templates 

We define a set of input variables characterizing the 
weight distribution for event i, denoted by {xi}^, where 
N is the number of variables. Examples of {xijjv might 
be the integrated weight in bins of a coarsely binned tem- 
plate, or they might be the moments of the weight dis- 
tribution. A probability density histogram for simulated 
signal events, /i^, is defined as an (TV -I- l)-dimensional 
histogram of input top quark mass vs. N variables. For 
background, hb is defined as an A'^-dimensional histogram 
of the {xi\N- Both and hb are normalized to unity: 



hs{{xi}N I mt)d{xi}N 



hb[{Xi\N)d{Xi}N = 1- 



(10) 
(11) 



An example of a template for the MWT method is shown 
on Fig. [5] We measure mt from hs{{xi}jsf,mt) and 
hbi{xi}N) using a maximum likelihood method. For each 
event in a given data sample, all {xi}N are found and 
used for the likelihood calculation. We define a likeli- 
hood C as 



C{{xi}N,nb,Nohs I mt) = 
nshs{{xi}N I mt) + nbhb{{xi}N) 



n 



(12) 



rib 



where A^obs is the number of events in the sample, Ub is 
the number of background events, and is the signal 
event yield. We obtain a histogram of —InC vs. mt for 
the sample. We fit a parabola that is symmetric around 
the point with the highest likelihood (lowest —InC). The 
fitted mass range is several times larger than the expected 
statistical uncertainty. It is chosen a priori to give the 
best sensitivity to the top quark mass using Monte Garlo 
pseudoexperiments, and is typically around ±20 GeV. 

We obtain measurements of mt for several channels by 
multiplying the likelihoods of these channels: 



- In/: = ^(- lurch) 



(13) 



ch 



where "ch" denotes the set of channels. In this pa- 
per, we calculate overall likelihoods for the 2£ subset, 
ch e {e/z, ee, ^/x}; the £-f track subset, ch G {e-|-track, 
/i-|-track}; and the five channel dilepton set, ch G 
{e/i, ee, /i/i, e-l-track, /u-l-track}. 



B. Choice of Template Variables 

The choice of variables characterizing the weight dis- 
tributions has been given some consideration in the past. 
For example, the DO MWT analysis and CDF j/WT anal- 
yses have used max^, [U, [11] . Earlier DO j^WT analyses 
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FIG. 5: (a) Correlation between the mean of the vWT weight distribution and the input rnt- (b) Correlation between i^WT 
fj,w and (Ju, for the efi channel. Three test masses of 155 GeV, 180 GeV, and 200 GeV are shown. 
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FIG. 6: An example of a template for the MWT method. 




employed a multiparameter probability density technique 
using the coarsel y b inned weight distribution to extract 
a measure of rrit [H, [l3| 

For the MWT analysis described here, we use the single 
parameter approach. In particular, to extract the mass, 
we use Eg . [T2l where Xi = {ma.Xt^}. We determine the 
values of and Ub by scaling the sum of the expected 
numbers of signal (fis) and background {fib) events in 
Table [T] to the number observed in each channel. We fit 
the histogram of — In £ to a parabola using a 40 GeV 
wide mass range centered at the top quark mass with 
the minimal value of — InC for all channels. 

For the lyWT analysis, we define the optimal set of 
input variables for the top quark mass extraction to be 
the set that simultaneously minimizes the expected sta- 
tistical uncertainty and the number of variables. The 
coarsely binned weight distribution approach exhibits up 
to 20% better statistical performance than single param- 
eter methods for a given kinematic reconstruction ap- 
proach by using more information. However, over five 
bins are typically needed, and the large number of vari- 
ables and their correlations significantly complicate the 
analysis. We study the performance of the method with 
many different choices of variables from the weight dis- 
tributions. These include single parameter choices such 



as maxm or fiu, , which provide similar performance in the 
range 140 GeV < nit < 200 GeV. 

Vectors of multiple parameters included various 
coarsely binned templates, or subsets of their bins. For 
the lyWT analysis, we observe that individual event 
weight distributions have fiuctuations which are reduced 
by considering bulk properties such as their moments. 
The most efficient parameters are the first two moments 
(pw and CTu,) of the weight distribution. This gives 16% 
smaller expected statistical uncertainty than using max^u 
or fi-uj alone. The improvement of the performance comes 
from the fact that (7^ is correlated with fj.^ for a given 
input top quark mass. This is shown in Fig. [5fb) for 
three different input top quark masses. The value of cr^ 
helps to better identify the range of input mt that is 
most consistent with the given event having a specific 
fiw This ability to de- weight incorrect mt assignments 
results in a narrower likelihood distribution and causes 
a corresponding reduction in the statistical uncertainty. 
No other choice of variables gives significantly better per- 
formance. The use of the weight distribution moments, 
Xi = {/iiu, tJu,}, in hg and hb with Eq. [T^lis termed i^WT/j. 

Because the templates are two-dimensional for back- 
ground and three-dimensional for signal, a small number 
of bins are unpopulated. We employ a constant extrap- 
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olation for unpopulated edge bins using the value of the 
populated bin closest in rrit but having the same /iu, and 
aw For empty bins flanked by populated bins in the mt 
direction but with the same /i^j and cFw, we employ a 
linear interpolation. We fit the histogram of — ln£ vs. 
nit with a parabola. When performing the fit, the vWTh 
approach determines Ug to be Hs — Nohs — nt- Fit ranges 
of 50, 40, and 30 GeV are used for £+track, 2i, and all 
dilepton channels, respectively. 



C. Probability Density Functions 

In both methods described above, there are finite 
statistics in the simulated samples used to model hg and 
hh, leading to bin- by-bin fluctuations. We address this in 
the vWT analysis by performing fits to hg and hb tem- 
plates. We term this version of the i^WT method i^WT/. 
For the signal, we generate a probability density function 
fs by fitting with the functional form 

fs{fJ'w,crw \mt) = ((T„ +Pi3)^' exp[-p7(cr^ +^13)^**] 

(1 -Pg) — ^exp[ 1 



1+P12 
Pll I 
-P9 ■ ^TTT^— — ^(M 



(TV27r 

m 



2^2 



, NVf-- )Pi"exp[-pii(/i„ )] 

1(1+P12) PlO PlO 

m 

X e(/x„ ^ 

PlO 



(14) 

The parameters m and a are linear functions of (7^ and 
mt: 



m = po+PiicTw - 36 GeV) + P2{mt - 170 GeV), 
cr ^ p3+pi{(7^ - 36 GeV) +P5{mt - 170 GeV). 



(15) 



Equations [14] and [15] are ad hoc functions determined 
empirically. A typical with respect to hg is found in 
the e/i channel which yields 4.0 per degree of freedom. 
The linear relationship between and /i^ is shown in 
Fig. [TJJa), which is an example of the probability den- 
sity vs. (7^ and /i^ for fixed input top quark mass of 
170 GeV. The dependence of fs on is expressed in 
the first line of Eq. [TJ] The second and third lines con- 
tain a Gaussian plus an asymmetrical function to de- 
scribe the dependence on The factors l/((TV27r) 
and p]^^" /T(l + P12) and the integral in the fourth 
line normalize the probability density function to unity. 
Examples of two-dimensional slices of three-dimensional 
signal histograms for fixed input nit = 170 GeV and 
aw — 30 GeV are shown in Fig. [TJ^a) and (c), respec- 
tively. The corresponding slices of the fit functions are 
shown in Fig.[7]^b) and (d), respectively. 

The background probability density function 
fbifJ-iu, o'w) is obtained as the normalized two-dimensional 



function of fiu, and a^i of simulated background events: 

/6(Aiu), fto) = 

exp [-(pi^i„ -I- p2aw - po)^ - {P4.^J■w + P5<7w - Pa)^] 
/o°° 1(7 [-(Pi^ + P2y - Po)^ - {P4X + p^y - Ps)^] dx dy ' 

... 

This is also an ad hoc function determined empirically. 
The fit is performed to hb containing the sums of all back- 
grounds for each channel and according to their expected 
yields. A typical with respect to hb is found in the e/i 
case which yields 5.2 per degree of freedom. Examples of 
hb and fb are shown in Fig.[5Ka) and (b), respectively. 

To measure Wt, we begin by adding two extra terms to 
the likelihood C of Eq. [T^] The first term is a constraint 
that requires that the fitted sum of the number of signal 
events Ug and the number of background events Ub agrees 
within Poisson fluctuations with the number of observed 
events iVobs: 

[n.g + nb)^"^' exp[-(ns -|- «&)] 



1 

{Ug + Ub, Nohs) 



A^obs! 



(17) 

The second term is a Gaussian constraint that requires 
agreement between the fitted number of background 
events Ub and the number of expected background events 
fib within Gaussian fluctuations, where the width of the 
Gaussian is given by the estimated uncertainty 5b on fit,: 

3(^6, hb, 5b) = J- exp [-{nb - fibf /25l]. (18) 



L 



gauss V 



The total likelihood for an individual channel is given by 
C{fJ'wi,o-yji, fib, Nohs I mt,ng,nb) = 



Cga.nss{nb, Ub, 5b)Cp oissoii 

{ng + Ub, Nohs) 
N (19) 

wi, ^wi) 



^ yr ngfg{fiw^,awi \ mt) + Ubf bjfJ- 

^\ rig+nb 
i—i 



The product extends over all events in the data sam- 
ple. The maximum of the likelihood corresponds to 
the measured top quark mass. We simultaneously min- 
imize — InC with respect to mt, ng, and nb using 
MINUIT [33, HH . The fitted sample composition is con- 
sistent with the expected one. This yields an analytic 
function of — ln£ vs. mt- Its statistical uncertainty is 
found by fixing ng and nb to their optimal values and 
taking half of distance between the points at which the 
— ln£ value is 0.5 units greater than its minimum value. 
We obtain measurements of mt for several channels by 
minimizing the combined — In C simultaneously with re- 
spect to mt and the numbers of signal and background 
events for the channels considered. 



D. Pseudoexperiments and Calibration 

The maximum likelihood fits attempt to account for 
the presence of background and for the signal and back- 
ground shapes of the templates. For a precise measure- 
ment of mt , we must test for any residual effects that can 
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FIG. 7: Slices of probability density histograms hs and fit functions fs for the i^WT method in the e/i channel. Probability 
densities vs. and /i^ for rrit = 170 GeV are shown for (a) hs and (b) fs. Probability densities vs. mt and for (Tm = 30 GeV 
are shown for (c) hs and (d) fs- 



cause a shift in the relationship between the fitted and 
actual top quark masses. We test our fits and extract 
correction factors for any observed shifts by performing 
pseudoexperiments. 

A pseudoexperiment for each channel is a set of sim- 
ulated events of the same size and composition as the 
selected dataset given in Table ID We compose it by ran- 
domly drawing simulated events out of the large Monte 
Carlo event pool. Within a given pool, each Monte Carlo 
event has a weight based on production information and 
detector performance parameters. An example of the 
latter is the 6-tagging efficiency which depends on jet px 
and rj, and an example of former is the weight with which 
each event is generated by ALPGEN. We choose a random 
event, and then accept or reject it by comparing the event 



weight to a random number. In this way, our pseudoex- 
periments are constructed with the mix of events that 
gives the correct kinematic distributions. 

For MWT, we compose pseudoexperiments by drawing 
Monte Carlo events from signal and background samples 
with probabilities proportional to the numbers of events 
expected, fis and fif,. Thus, we draw events for each 
source based on a binomial probability. In the i^WT pseu- 
doexperiments, the number of background events of each 
source is Poisson fluctuated around the expected yields 
of Table IH The remaining events in the pseudoexperi- 
ment are signal events. If the sum of backgrounds totals 
more than A'obs, then the extra events are dropped and 
Us = 0. In this way, we do not use the tt production 
cross section, which is a function of mt. 
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FIG. 8: Probability density histogram ht and fit function fs for the i/WT method in the ejj. channel. Probability densities vs. 
CT„ and are shown for (a) hb and (b) /;,. 



To establish the relationship between the fitted top 
quark mass, m^*, and the actual generated top quark 
mass rrit , we assemble a set of many pseudoexperiments 
for each input mass. For the I'WT method, we choose 
300 pseudoexperiments because that is the point at which 
the Monte Carlo statistics begin to be oversampled, and 
all pseudoexperiments can still be considered statistically 
independent. We average m^* for each input rrit for each 
channel. We combine channels according to Eq. [T31 and 
we fit the dependence of this average mass on mt with 



,fit\ 



a{mt - 170 GeV) + /3 + 170 GeV. 



(20) 



The calibration points and fit functions are shown in 
Fig. [9] The results of the fits are summarized in Ta- 
ble [TTl Ideally, a and P should be unity and zero, re- 
spectively. The mf * of each pseudoexperiment and data 
measurement is corrected for the slopes and offsets given 
in Table HI] by 

^meas ^ ^-1 (^fit _ /J _ IJQ GcV) + 170 GcV. (21) 



The pull is defined as 
pull = 



mt 



(22) 



where a{m™'''^) = a~^(T{mf^) is the measured statisti- 
cal uncertainty after the calibration of Eq. [^T] The ideal 
pull distribution has a Gaussian shape with a mean of 
zero and a width of one. The pull widths from pseudo- 
experiments are given in Table HIl A pull width larger 
(less) than one indicates an underestimated (overesti- 
mated) statistical uncertainty. The uncertainty of the 
data measurement is corrected for deviations of the pull 
width from one as well as for the slope of the calibra- 
tion curve. The mean of the distribution of calibrated 



and pull-corrected statistical uncertainties yields the ex- 
pected statistical uncertainty (see Table |ll]) . Figure [10] 
shows the pull-width-corrected distribution of statistical 
uncertainties for the nit measurements from the ensem- 
ble testing. The expected uncertainty on the combined 
measurement for all channels is 5.1, 5.3, and 5.8 GeV for 
i^WT,,, vWTf, and MWT, respectively. 



VI. RESULTS 

The calibrated mass and statistical uncertainties for 
the 2i, i'-l-track, and their combination are shown in 
Table Hill for each of the three methods. The — ln£ fits 
from the vWT,,, z/WT/, and MWT methods, including 
data points, are shown in Fig. 1111 There are no data 
points for the :^WT y fit since the corresponding curve is 
a one-dimensional slice of an analytic three-dimensional 
fit function, /g. The calibrated statistical uncertainties 
determined in data from these likelihood curves are 
shown by arrows in Fig. [101 The statistical uncertainties 
agree with the expectations from ensemble testing. 

A. Systematic Uncertainties 

The top quark mass measurement relies substantially 
on the Monte Carlo simulation of ti signal and back- 
grounds. While we have made adjustments to this model 
to account for the performance of the detector, residual 
uncertainties remain. The limitations of modeling of 
physics processes may also affect the measured mass. 
There are several categories of systematic uncertainties: 
modeling of physics processes, modeling of the detector 
response, and the method. We have estimated each of 
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FIG. 9: The combined calibration curves corresponding to the (a) z/WT^, (b) vWTf, and (c) MWT methods. Overlaid is the 
result of the linear fit as defined in Eq. |20l The uncertainties are small and corresponding bars are hidden by the markers. 



TABLE II; Slope (a) and offset {(3) from the linear fit in Eq.[20]to the pseudoexperiment results of Fig. |9]for the 21, ^+track, 
and combined dilepton channel sets using the MWT and !/WT methods. 



Method 


Channel Slope: a 


Offset: 


/3 [GeV] 


Pull width 


Expected statistical 






















uncertainty [GeV] 




2£ 0.98 


± 


0.01 


-0.04 


± 


0.11 


1.02 


± 


0.02 


5.8 


J/WTh 


^+track 0.92 


± 


0.02 


2.28 


± 


0.27 


1.04 


± 


0.02 


13.0 


vWTh 


combined 0.99 


± 


0.01 


-0.04 


± 


0.11 


1.03 


± 


0.02 


5.1 


vWTf 


2£ 1.03 


± 


0.01 


-0.32 


± 


0.15 


1.06 


± 


0.02 


5.8 


i^WT/ 


f+track 1.07 


± 


0.03 


-0.04 


± 


0.37 


1.07 


± 


0.02 


12.9 


uWTf 


combined 1.04 


± 


0.01 


-0.45 


± 


0.13 


1.06 


± 


0.02 


5.3 


MWT 


2£ 1.00 


± 


0.01 


0.95 


± 


0.05 


0.98 


± 


0.01 


6.3 


MWT 


^+track 0.99 


± 


0.01 


0.64 


± 


0.12 


1.06 


± 


0.01 


13.8 


MWT 


combined 0.99 


± 


0.01 


0.97 


± 


0.05 


0.99 


± 


0.01 


5.8 



these as follows. 



1. Physics Modeling 

a. b fragmentation A systematic uncertainty arises 
from the different models of b quark fragmentation, 
namely the distribution of the fraction of energy taken 
by the heavy hadron. The standard DO simulation used 
for this analysis utilized the default pythia tune in the 
Bowler scheme [32j . We reweighted our tt simulated 
samples to reach consistency with the fragmentation 
model measured in e+e" ^ Z ^ bb decays [33|. A 
systematic uncertainty is assessed by comparing the 
measured mt in these two scenarios. 

b. Underlying event model An additional system- 
atic uncertainty can arise from the underlying event 
model. We compare measured top quark masses for the 
PYTHIA tune DW ^ with the nominal model (tune A). 



c. Extra jet modeling Extra jets in top quark events 
from gluon radiation can affect the tt px spectrum, and 
therefore the measured rrit- While our models describe 
the data within uncertainties for all channels, the ratio 
of the number of events with only two jets to those 
with three or more jets is typically four in the Monte 
Carlo and three in the data. To assess the affect of this 
difference, we reweight the simulated events with a top 
quark mass of 170 GeV so that this ratio is the same. 
Pseudoexperiments with reweighted events are compared 
to the nominal pseudoexperiments to determine the 
uncertainty. 

d. Event generator There is an uncertainty in event 
kinematics due to the choice of an event generator. This 
can lead to an uncertainty in the measured top quark 
mass. To account for variations in the accuracy of ti 
generators, we compare pseudoexperiment results using 
tt events generated with ALPGEN to those generated 
with PYTHIA for mt = 170 GeV. The difference between 
the two estimated masses is corrected by subtracting the 
expected statistical uncertainty divided by the square 
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FIG. 10: Distribution of statistical uncertainties for top quark mass measurements of pseudoexperiments for the combination of 

all channels for simulated events with nit = 170 GeV for the (a) i^WTh, (b) i^WT/, and (c) MWT methods. The uncertainties 
are corrected by the calibration curve and for the pull width. The arrows indicate the statistical uncertainties for the measured 
top quark mass. 



TABLE 111: Calibrated fitted mt for the lyWTh, vWTf, and MWT methods. All uncertainties are statistical. 



Channel i^WT^ [GeV] uWTf [GeV] MWT [GeV] 

2e 177.5 ± 5.5 176.1 ± 5.8 176.6 ± 5.5 

£+track 170.7 ± 12.3 174.6 ± 13.8 165.0 ± 8.5 

combined 176.3 ± 4.9 176.0 ± 5.3 173.2 ± 4.9 



root of the number of pseudoexperiments. 

e. PDF variations The top quark mass measure- 
ment relics on Monte Carlo events generated with a 
particular PDF set (CTEQ6L1). Moreover, this PDF 
set is used directly by the MWT method. We estimate 
the resulting uncertainty on nit by reweighting the 
Monte Carlo according to the different eigenvectors of 
the CTEQ6L1 PDFs. For each choice, we measure a 
new mass and the difference between the mass obtained 
with reweighting and a nominal mass is computed. The 
resulting uncertainty is the sum in quadrature of all 
above uncertainties. 

/. Background template shape The uncertainties 
on the background kinematics can affect the template 
shapes and consequently the measured top quark mass. 
The uncertainty from the background template shape 
is found by substituting simulated WW events for all 



Z backgrounds (including Z tt) in all pseudoex- 
periments. The uncertainty is taken as the difference 
between the average measured top quark mass with this 
assumption and the nominal value. 



2. Detector Modeling 

a. Jet energy scale Because the 6 jets carry the 
largest share of the energy in top quark events, their 
calibration has the largest affect on the uncertainty on 
mt- Ideally, the procedure to calibrate jet energies in 
data and Monte Carlo achieves the same energy scale 
in both. However, each procedure yields a systematic 
uncertainty. We estimate the resulting uncertainty by 
repeating the pseudoexperiments with simulated events 
in which the jet energies are shifted up and down by 
the known pT and rj dependent uncertainty, obtained by 
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FIG. 11: Negative log-likelihood —InjC vs. mt for the combination of all channels before calibration for the (a) fWTh, (b) 
i^WTf, and (c) MWT methods. 
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summing in quadrature uncertainties on the data and 
Monte Carlo jet energy scales. The probability density 
histograms {hg, ht) and functions (/s, ft) are left with 
the nominal calibration. 

b. b /light quark response ratio This uncertainty 
arises from the fact that the jets in signal events 
are primarily b jets. These have a different detector 
response than the light quark and gluon jets which 
dominate the 7+jet sample used to derive the overall 
jet energy calibration. By applying this calibration to 
the b jet sample, a 1.8% shift in jet pr is observed [35| . 
We adjust the jets in the Monte Carlo for this and 
propagate the correction into the ^7-. This causes a 
shift in the measured mt which is taken as an uncertainty. 

c. Sample dependent jet energy scale After the 
initial calibration, a residual shift in jet px distributions 
is observed in Z+jets events when comparing data 
and Monte Carlo. We adopt a further calibration that 
improves agreement in these distributions and apply it to 
all of our background samples. Because this correction 
may not apply to ti events, we take the shift in the 
measured mt to be a systematic uncertainty. 

d. Object resolution The jet resolution from the 
simulation is better than that observed in data. To 
improve the agreement, we apply an additional smearing 
to Monte Carlo events. A residual difference between 
data and Monte Carlo jet resolutions can lead to a mass 
bias. To estimate the effect on the mt measurement, we 
repeat the pseudoexperiments by adjusting this smearing 
up and down within its uncertainty while keeping 
and hb with the nominal resolutions. We estimate the 
systematic uncertainties arising from the muon, isolated 
track, and electron px resolutions in a similar way. 

e. Jet identification Jet reconstruction and iden- 
tification efficiency in Monte Carlo is corrected to 
match the data. We propagate the uncertainty on the 
correction factor to the top quark mass measurement. 

/. Monte Carlo corrections Residual uncertainties 
on the Monte Carlo corrections exist for triggering, 
luminosity profiles, lepton identification, and 6-tagging. 
These uncertainties affect the top quark mass uncertain- 
ties. In each case, a respective systematic uncertainty 
on mt is found by reweighting events according to 
the uncertainties of Monte Carlo correction factors 
introduced to reproduce the data. 



3. Method 

a. Background yield Due to the limited statistics 
of the simulation, there is some uncertainty in the event 
yields for the background processes. This uncertainty 



affects the likelihood and the measured top quark mass 
via Eq. [T^ To estimate the effect of the uncertainty on 
background event yield, we vary the total background 
yield by its known uncertainty up and down keeping 
the relative ratios of individual background processes 
constant. 

b. Template statistics The templates used in the 
MWT and vW^t methods have finite statistics. Local 
fluctuations in these templates can cause local fluctua- 
tions in the individual likelihood fits and the top quark 
mass. We obtain an uncertainty in mt by varying the 
— \nC points from the data ensemble within their errors. 
The width of the mt distribution provides the systematic 
uncertainty. 

For the vWT f method, the fs function depends on 
fifteen parameters, each of which has a corresponding 
uncertainty. Consequently, there is some uncertainty 
on the shape of this function. There is a corresponding 
uncertainty on the parameters of the fb- The uncertainty 
on the shape causes the fit uncertainty on the measured 
top quark mass. We find the impact of this uncertainty 
on the data sample by varying the parameters of fs and 
fb within their uncertainties. For each such variation, we 
remeasure mt for the data sample. The fit uncertainty 
is the width of this distribution. 

c. Monte Carlo calibration There is an uncertainty 
on fitting the parameters (slope and offset) of the cali- 
bration curve. This uncertainty causes an uncertainty 
in the calibrated top quark mass. The calibration 
uncertainty is obtained as the uncertainty of the offset. 

A summary of estimated systematic errors for the com- 
bined dilepton channels is provided in Table [TVl We as- 
sumed the systematic uncertainties for all three methods 
to be completely correlated for each source of systematic 
uncertainty and uncorrelated among different sources. 
The correlations of statistical uncertainties are given in 
the next Section. All uncertainties are corrected for the 
slope of the mass scale calibration. The total uncertainty 
is found by assuming all the contributions are indepen- 
dent and adding them in quadrature. 



B. Combined Results 

Because the statistical use of the vWT moments tem- 
plate is different between the vWTh and vWT f meth- 
ods, ensemble tests show that these two measurements 
are only 85% correlated. We combine them using the 
best linear unbiased estimator (BLUE) method to 
form a final z^WT measurement. Applying the correlation 
to the measurements from data, we obtain a final vWT 
measurement of mt — 176.2 ± 4.8(stat) ± 2.1(sys) GeV 
for the combination of all five channels. We treat all 
systematic uncertainties as 100% correlated across meth- 
ods except for the Monte Carlo calibration and template 
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TABLE IV: Summary of systematic uncertainties for the combined analysis of all dilepton channels. The z/WT^, I'WT/, and 
MWT method results are shown. 
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statistics uncertainties. These are treated as uncorrelated 
and 85% correlated, respectively. The individual system- 
atic uncertainties on the lyWT combination are the same 
as those for the j/WT^ method to the precision given in 
Table IVl 

The final MWT measurement is mt = 173.2 ± 
4.9(stat) ± 2.0(sys) GeV for the combination of all 
dilepton channels. The total systematic uncertainties 
are 2.0 GeV and 2.4 GeV for the 21 and ^+track MWT 
results, respectively. The vWT and MWT approaches 
use partially different information from each tt event; the 
two results are measured to be 61% correlated. There- 
fore, we use the BLUE method to determine an overall 
measurement of mt — 174.7 ± 4.4(stat) ± 2.0(sys) GeV 
for the combination of all dilepton channels. We treat 
all systematic uncertainties as 100% correlated across 
methods except for two uncertainties. The Monte 
Carlo calibration and template statistics systematic 
uncertainties are treated as uncorrelated and 61% 
correlated, respectively. The channel-specific results 
for both measurement combinations are given in Table [V] 



VII. CONCLUSIONS 

In 1 fb^^ of collision data from the Fermilab Teva- 
tron collider, we employed two mass extraction methods 
to measure nit in tt events with two high px final state 
leptons. We analyzed three channels with two fully iden- 
tified leptons (e/i, ee, and iijj) and two channels with 
relaxed lepton selection and a ^-tagged jet (e-|-track and 
/i-|-track). Using the I'WT event reconstruction, we per- 
form a maximum likelihood fit to the first two moments 



of the resulting distribution of relative weight vs. mt to 
measure 

mt = 176.2±4.8(stat)±2.1(sys)GeV. 

(23) 

We also employ the MWT method using a fit to the mass 
giving the maximum weight. We measure 

mt = 173.2±4.9(stat)±2.0(sys)GeV. 

(24) 

Accounting for correlations in these two measurements, 
we obtain a final combined result of 

mt = 174.7±4.4(stat)±2.0(sys)GeV. 

(25) 

Our result is consistent with the current world average 
value of mt [37]. 
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TABLE V: Final results for the !/WT method and !/WT+MWT combination. 



Channel i^WT [GeV] i^WT+MWT [GeV] 

2£ 177.1 ± 5.4(stat) ± 2.0(sys) 176.9 ± 4.8(stat) ± 1.9(sys) 

£+track 171.2 ± 12.3(stat) ± 2.7(sys) 165.7 ± 8.4(stat) ± 2.4(sys) 

combined 176.2 ± 4.8(stat) =b 2.1(sys) 174.7 ± 4.4(stat) ± 2.0(sys) 
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